Automated Classification of Vowel Category and Speaker Type in the High-Frequency Spectrum
نویسندگان
چکیده
The high-frequency region of vowel signals (above the third formant or F3) has received little research attention. Recent evidence, however, has documented the perceptual utility of high-frequency information in the speech signal above the traditional frequency bandwidth known to contain important cues for speech and speaker recognition. The purpose of this study was to determine if high-pass filtered vowels could be separated by vowel category and speaker type in a supervised learning framework. Mel frequency cepstral coefficients (MFCCs) were extracted from productions of six vowel categories produced by two male, two female, and two child speakers. Results revealed that the filtered vowels were well separated by vowel category and speaker type using MFCCs from the high-frequency spectrum. This demonstrates the presence of useful information for automated classification from the high-frequency region and is the first study to report findings of this nature in a supervised learning framework.
منابع مشابه
High Prevalence of CTXM-15 Type Extended-Spectrum Beta-Lactamase Among Clinical Isolates of Klebsiella Pneumoniae
Background: Production of β–lactamases by enterobacteriacea, especially Klebsiella pneumoniae, is one of the emerging health problems in the world. The purpose of this study was to assess the frequency of blaCTX-M15 gene in K. pneumoniae isolates and determine the molecular diversity of CTXM producing isolates. Methods: In...
متن کاملBiologically inspired speaker verification
Speaker verification is an active research problem that has been addressed using a variety of different classification techniques. However, in general, methods inspired by the human auditory system tend to show better verification performance than other methods. In this thesis three biologically inspired speaker verification algorithms are presented. The first is a vowel-dependent speaker verif...
متن کاملAssimilation of Final Low Back Vowel in Eghlidian Dialect
In this article, the low back vowel /A/ in word-final positions in Eghlidian dialect, one of Persian dialects, is studied. This vowel is represented phonetically as [A], [o] and [@] in different phonetic environments. Therefore many words were collected via interviewing ten native speakers so that these different alternant forms can be accounted for appropriately. Since one of the authors of th...
متن کاملIndexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences
Infants preferentially discriminate between speech tokens that cross native category boundaries prior to acquiring a large receptive vocabulary, implying a major role for unsupervised distributional learning strategies in phoneme acquisition in the first year of life. Multiple sources of between-speaker variability contribute to children's language input and thus complicate the problem of distr...
متن کاملDiscrete Wavelet Transform & Linear Prediction Coding Based Method for Speech Recognition via Neural Network
In the proposed work, the techniques of wavelet transform (WT) and neural network were introduced for speech based text-independent speaker identification and Arabic vowel recognition. The linear prediction coding coefficients (LPCC) of discrete wavelet transform (DWT) upon level 3 features extraction method was developed. Feature vector fed to probabilistic neural networks (PNN) for classifica...
متن کامل